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Abstract 

Background: Dynamic changes to the epigenome play a critical role in establishing and maintaining cellular 
phenotype during differentiation, but little is known about the normal methylomic differences that occur between 
functionally distinct areas of the brain. We characterized intra- and inter-individual methylomic variation across 
whole blood and multiple regions of the brain from multiple donors. 

Results: Distinct tissue-specific patterns of DNA methylation were identified, with a highly significant over- 
representation of tissue-specific differentially methylated regions (TS-DMRs) observed at intragenic CpG islands and 
low CG density promoters. A large proportion of TS-DMRs were located near genes that are differentially expressed 
across brain regions. TS-DMRs were significantly enriched near genes involved in functional pathways related to 
neurodevelopment and neuronal differentiation, including BDNF, BMP4, CACNA1A, CACA1AF, EOMES, NGFR, NUMBL, 
PCDH9, SLIT1, SLITRK1 and SHANK3. Although between-tissue variation in DNA methylation was found to greatly 
exceed between-individual differences within any one tissue, we found that some inter-individual variation was 
reflected across brain and blood, indicating that peripheral tissues may have some utility in epidemiological studies 
of complex neurobiological phenotypes. 

Conclusions: This study reinforces the importance of DNA methylation in regulating cellular phenotype across 
tissues, and highlights genomic patterns of epigenetic variation across functionally distinct regions of the brain, 
providing a resource for the epigenetics and neuroscience research communities. 
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Background 

DNA methylation is a key epigenetic mechanism 
involved in the developmental regulation of gene expres- 
sion [1], but the tissue-specific nature of DNA methyla- 
tion has not been fully characterized at a genomic level. 
Epigenetic processes control several neurobiological and 
cognitive processes, including neurogenesis and brain 
development [2], neuronal activity [3], learning and 
memory [4], drug addiction [5], neurodegeneration [6], 
and circadian rhythm [7]. The importance of DNA 
methylation in normal brain function and development 
is exemplified by the neurodevelopmental deficits 
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associated with mutations in the methyl CpG binding 
protein 2 gene (MECP2) in Rett syndrome [8], and the 
aberrant DNA methylation signatures observed in neu- 
ropsychiatry disorders, including schizophrenia and 
bipolar disorder [9]. Although gene expression analyses 
have highlighted clear transcriptomic differences across 
brain regions [10-12], current studies of tissue-specific 
DNA methylation in the brain have assessed only a 
small percentage of CpG sites in the human genome 
[13], and none has taken an unbiased methylome-wide 
approach across multiple brain regions and blood 
obtained from the same individuals. Little is known, 
therefore, about normal methylomic differences between 
functionally distinct areas of the brain and how these 
correspond to patterns observed in easily accessible per- 
ipheral tissues such as blood. In this study we used 
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methylated DNA immunoprecipitation combined with 
ultra-deep sequencing (MeDIP-seq) to profile the 
methylomic landscape across multiple dissected brain 
regions and blood obtained from multiple individuals. 
We present annotated maps of the brain methylome, 
representing a unique resource for the genomics and 
neuroscience research communities, identifying key 
regions of the genome characterized by functionally 
relevant tissue-specific DNA methylation. 

Results and discussion 

Methylomic profiling across brain and blood 

Our primary methylomic profiling experiments used 
multiple dissected brain regions (inferior frontal gyrus, 
middle frontal gyrus, entorhinal cortex, superior tem- 
poral gyrus of the temporal cortex, visual cortex, and 
cerebellum) from post-mortem brain samples obtained 
from individuals free of any neuropathology and neu- 
ropsychiatry disease. From a subset of these indivi- 
duals, whole blood samples were also obtained 
longitudinally prior to death. A detailed list of the pri- 
mary samples used in this study is given in Supple- 
mentary Table 1 in Additional file 1, Of these, 21 
tissue samples from three individuals (two female, one 
male) were initially assessed using ultra-deep paired- 
end MeDIP-seq (see Materials and methods). After 
stringent quality control (Supplementary Figure 1 in 
Additional file 1), an average of >70.4 million uniquely 
mapped 50 bp reads were obtained from each of the 
21 samples (Supplementary Table 2 in Additional file 
1); to our knowledge, this represents the largest 
between-individual and cross-tissue DNA methylation 
dataset yet produced. To generate an estimate of actual 
DNA methylation from our MeDIP-seq data, we used 
the MEDIPS analysis package [14] to control for local 
CpG density and generate DNA methylation scores for 
overlapping 500 bp bins across the genome. Bisulfite 
pyrosequencing was used to verify DNA methylation 
estimates at selected regions of the genome, and exam- 
ine base pair-specific levels of DNA methylation across 
nominated regions in additional brain and blood sam- 
ples. Normalized raw MeDIP-seq reads and MEDIPS- 
estimated absolute DNA methylation values for each 
tissue/individual combination are available as a 
resource for download and browsing as UCSC tracks 
from our laboratory website [15]. The data are also 
being integrated into the Human Epigenome Atlas 
[16,17] as part of the regular data release by NIH Epi- 
genomics Roadmap Initiative [18]. 

Genome-wide DNA methylation across cortex, cerebellum 
and peripheral blood is highly tissue-specific 

As expected, canonical genie DNA methylation profiles 
do not differ across samples or tissue types, with overall 



low average DNA methylation around the immediate 
transcription start site, high levels of DNA methylation 
across the gene body, and more subtle hypomethylation 
being observed at the 3' end of genes (Supplementary 
Figure 2 in Additional file 1), confirming previous obser- 
vations [19]. Genome-wide, however, there are striking 
tissue-specific differences in DNA methylation, with a 
clear hierarchical distinction between the six cortical 
regions, cerebellum and blood (Figure la, b). These 
broad differences reflect the known developmental path- 
ways of the three tissues; blood cells originate from the 
mesoderm, while cells of the central nervous system are 
ectodermic. Within the brain, the cerebellum develops 
from the metencephalon, whilst the cerebral cortex 
develops from the most anterior part of the neural plate 
(the telencephalon). The cortex itself is subdivided into 
numerous functionally distinct anatomical regions, spe- 
cializing in sensory, motor, and association tasks. 

DNA methylation varies across different functionally 
annotated regions of the genome 

We defined feature sets spanning i) all annotated CpG 
islands (CGIs), sub-typed by location (gene promoter, 
intragenic, 3' UTR, and intergenic) as described pre- 
viously [19], ii) CGI shores (spanning 2,000 bp up- and 
downstream of each CGI), and Hi) all annotated coding 
sequences (CDSs). We also examined tissue-specific pat- 
terns of DNA methylation across gene promoters 
defined by low, medium and high CG content (LCPs, 
ICPs, and HCPs, respectively) [20]. BED files of the fea- 
ture annotations used in this study (CGIs, CGI shores, 
CDSs, LCPs, ICPs, and HCPs) are available for down- 
load from our laboratory webpage [15]. DNA methyla- 
tion across each feature was quantified using both 
normalized MeDIP-seq read counts and MEDIPS scores; 
Figure 2a shows the average DNA methylation levels for 
each feature type. As described previously [21], CGIs are 
significantly hypomethylated compared to CGI shores 
and the gene body, with no overall difference in canoni- 
cal methylation patterns between tissues or individuals. 
There is, however, considerable heterogeneity in DNA 
methylation across different categories of CGI, depen- 
dent upon genomic location, with promoter CGIs being 
significantly hypomethylated in comparison to intra- 
genic, 3' UTR, and intergenic CGIs. CGI shores, on the 
other hand, do not vary significantly as a function of 
genomic location. Although the majority of promoters 
(approximately 60%) are associated with CGIs, not all 
promoter regions are hypomethylated; promoter methy- 
lation is inversely correlated with CG density, with LCPs 
being the most methylated of any feature type tested. In 
contrast, HCPs (which overlap considerably with promo- 
ter CGIs) are hypomethylated relative to the other 
features. 
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Figure 1 Methylomic profiling across multiple brain areas and blood from a cohort of individuals highlights clear tissue-specific 
differences in DNA methylation. (a, b) DNA methylation was calculated from ultra-deep MeDIP-seq data using 500 bp bins across the 
genome, and the relationship between tissues determined by Pearson correlation (a) and unsupervised hierarchical clustering (b). BA, Brodmann 
area; Ent Ctx, entorhinal cortex; STG, superior temporal gyrus; Vis Ctx, visual cortex. 



DNA methylation differs significantly between tissues 
across all annotated feature types 

We next examined tissue- and brain region-specific 
DNA methylation in the context of annotated gene 
features to identify regions of the genome harboring 
tissue-specific differentially methylated regions (TS- 
DMRs). Due to the overall similarity between the six 
cortical regions, we initially focused on gross differ- 
ences between cortex, cerebellum and blood. Cross-tis- 
sue normalized MeDIP-seq data for all 65,535 
annotated features can be downloaded from our 
laboratory website [15]. For each broad category of fea- 
ture, hierarchical clustering of the MeDIP-seq data can 
clearly distinguish between tissue types (Supplementary 
Figure 3 in Additional file 1). The top 50 variably 
methylated annotated features across cortex, cerebel- 
lum and blood are listed in Supplementary Table 3 in 
Additional file 1. Using Illumina 450 K HumanMethy- 
lation microarray data obtained from matched cerebel- 
lum, frontal cortex, and whole blood samples from 90 
individuals being assessed by our group as part of an 
ongoing clinical epigenetics study (Mill et al., unpub- 
lished data), we were able to confirm that the vast 
majority of these features are true TS-DMRs (Supple- 
mentary Table 4 in Additional file 1); 74% of the 206 
probes mapping to the broad genomic regions covered 



by these features were characterized by false discovery 
rate-significant intra-individual between-tissue DNA 
methylation differences in the direction predicted by 
our MeDIP-seq data. 

Ingenuity Pathway Analysis (IPA) of the most vari- 
able features (defined as those with a coefficient of 
variance (CV) >1 across the three tissue types) high- 
lights a highly significant enrichment of functional 
pathways involved in regulating developmental gene 
expression (P = 8.83 x 10" ), organismal development 
(P = 1.37 x 10~ 20 ), and tissue differentiation (P = 2.34 
x 10 20 ) (Supplementary Figure 4 in Additional file 1). 
Of note, given the origins of the samples used in this 
analysis, it is interesting that the primary tissue-speci- 
fic functional pathways enriched in the list of TS- 
DMRs are nervous system development and function 
(P = 4.60 x 10~ 21 ) and hematological system develop- 
ment and function (P = 3.61 x 10" 8 ), indicating that 
epigenetic differences are likely to be associated with 
significant phenotypic differences. Although TS-DMRs 
are distributed across all feature types, there are 
marked differences in the between-tissue correlation 
of DNA methylation across each of the broad feature 
categories we examined, with CGIs being more corre- 
lated across cortex, cerebellum and blood than CGI 
shores or CDSs (Figure 2b). 
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Figure 2 Although DNA methylation at CGIs is relatively conserved across tissues, intragenic CGIs are dramatically over-represented 
and promoter CGIs under-represented in the most tissue-variable CGIs. (a) Average DNA methylation values calculated by MEDIPS from 
MeDIP-seq data for all annotated gene features: CGIs (yellow), CGI shores (blue), gene promoters (red) and CDSs (green). DNA methylation is 
lower in promoter CGIs compared to intragenic, 3' UTR and intergenic CGIs. CGI shores are characterized by higher DNA methylation than CGIs, 
with less location-dependent variation. Promoter DNA methylation shows a strong inverse correlation with GC density, with LCPs showing a 
higher average level of DNA methylation than CDSs. Error bars represent standard error of the mean, (b) Although TS-DMRs are distributed 
across all feature types, there are marked differences in the between-tissue correlation of DNA methylation across each of the broad feature 
categories we examined, with CGIs being more correlated across cortex, cerebellum and blood than CGI shores or CDSs. (c, d) There is a highly 
significant enrichment of intragenic CGIs (P = 2 x 10~ 102 ) in analyses of CGI DMRs differentiating blood, cortex and cerebellum (c), and an even 
more dramatic enrichment (P = 1 x 1CT 246 ) in comparisons between cortex and cerebellum (d). EXP, expected; OBS, observed. 



TS-DMRs across brain regions are associated with stable 
gene expression differences and are highly enriched for 
functionally relevant neurobiological pathways 

Within the brain, the top 50 differentially methylated 
annotated features between cortical and cerebellum 
samples are listed in Supplementary Table 5 in Addi- 
tional file 1. Strikingly, many of these DMRs are asso- 
ciated with genes of known relevance to cortical and/or 
cerebellar development and function, including 
PPP2R2B (encoding a phosphatase implicated in the 
negative control of cell growth and division, whose dis- 
ruption causes cerebellar ataxia [22]), JAKMIP1/MAR- 
LIN1 (encoding a RNA-binding protein that associates 
with GABA receptors [23]), EOMES (encoding a tran- 
scriptional activator that plays a crucial role in brain 
development, controlling the proliferation of intermedi- 
ate progenitor cells and their progeny in the cerebral 
cortex [24]), GPMB6 (encoding a proteolipid widely 
expressed in neurons and in oligodendrocytes [25]), and 
GRM4 (encoding a metabotropic glutamate receptor 
with a distinct distribution in the brain, primarily 
expressed in cerebellar granule cells [26]). Furthermore, 
IPA of the top within-brain variable DMRs 



(differentiating cortex from cerebellum) highlights a pri- 
mary network involved in nervous system development 
and function (Supplementary Figure 5 in Additional file 
1), with highly significant enrichment of developmen- 
tally relevant pathways, including neurogenesis (P = 1.76 
x 10' 19 ), the guidance of neurites (P = 2.11 x 10~ 12 ), 
development of the cerebellum (P = 8.49 x 10' 5 ) and 
development of the cortex (P = 1.11 x 10 ~ 4 ) (Supple- 
mentary Table 6 in Additional file 1). 

We next used RNA extracted from matched cerebel- 
lum and frontal cortex (Brodmann area (BA)9) samples, 
obtained from an independent cohort of 42 additional 
individuals, to assess tissue-specific expression levels for 
detectable gene transcripts located in the vicinity of the 
top 50 cortex-cerebellum DMRs. The majority (82%) of 
these DMRs are mirrored by significant (P < 0.05) gene 
expression differences between cerebellum and frontal 
cortex (Supplementary Table 7 in Additional file 1), 
with 61% of genes being represented by at least one 
probe with a highly significant (P < 1 x 10' ) expression 
difference across the two brain regions. Interestingly, 
DNA methylation at these DMRs is not always nega- 
tively correlated with gene expression; EOMES, for 
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example, shows highly significant elevated expression in 
the cerebellum compared to the frontal cortex (log2 
expression values = 9.25 versus 6.46, P = 1.90 x 1CT 33 ). 
An analysis of publically available gene expression data 
[12] confirms many of these gene expression differences 
and demonstrates that many of the cortex-cerebellum 
DMRs are associated with developmentally stable gene 
expression differences between the brain regions (Sup- 
plementary Figure 6 in Additional file 1). This further 
supports the notion that the TS-DMRs identified here 
mediate functionally important differences in the cellular 
transcriptome. 

DNA methylation differs across functionally discrete 
regions of the cerebral cortex 

Although for all features tested the six cortical regions 
formed a tight cluster, distinct from both cerebellum and 
blood (Figure la, b), we were interested to see if we could 
identify DMRs that could distinguish between them. Sup- 
plementary Table 8 in Additional file 1 lists the 50 most 
variably methylated features across samples obtained 
from the frontal cortex (BA8, BA9, and BA10), entorhinal 
cortex, superior temporal gyrus, and visual cortex. While 
the magnitude of within-cortex variation is clearly lower 
than observed between average cortex and either cerebel- 
lum or blood, there are some noticeable region-specific 
patterns of DNA methylation, particularly in the visual 
cortex. This list of DMRs contains a striking number of 
genes implicated in brain function related to the cortex 
and neurodevelopment, including CACNA1A and CAC- 
NA1F (calcium-channel genes involved in neuronal 
growth and development and controlling the release of 
neurotransmitters [27]), GALNT9 (a brain-specific O-gly- 
cosylase [28]), SLC8A2/NCX2 (a sodium/calcium exchan- 
ger that has been shown to be important in synaptic 
plasticity and cortical development [29]), NUMBL 
(encoding a protein that maintains progenitor cells dur- 
ing cortical neurogenesis [30]), and GRIKS (a receptor for 
the excitory neurotansmitter glutamate [31]). IP A on loci 
associated with the top 500 across-cortex variably methy- 
lated features highlights an interactive network of genes 
involved in neurodevelopment and function (Supplemen- 
tary Figure 7 in Additional file 1), with a significant 
enrichment for functional pathways associated with neu- 
rogenesis and neuronal function, including many directly 
related to development of the cortex, such as 'forebrain 
development' (P = 6.15 x 10" 7 ) (Supplementary Table 9 
in Additional file 1). 

Although DNA methylation at promoter CGIs is strongly 
conserved across brain areas and blood, tissue-specific 
DNA methylation is particularly striking at intragenic CGIs 

The top 50 differentially methylated CGIs between cor- 
tex, cerebellum and blood are listed in Supplementary 



Table 10 in Additional file 1. Of particular interest is 
the observation that a CGI associated with BMP4, which 
encodes a protein mediating differentiation of the ecto- 
derm into the nervous plate [32], is one of the strongest 
CGI DMRs between blood and brain (cortex and cere- 
bellum). Also showing highly variable patterns of DNA 
methylation between brain tissues and blood are CGIs 
associated with other key neurodevelopmental genes, 
including BDNF (encoding a neurotrophic factor with 
an important role in neurodevelopment and neurogen- 
esis [33]) and SLITRK1 (encoding an integral membrane 
protein involved in neurite outgrowth [34]). IPA of the 
most variable (CV >1) blood versus brain CGIs reveals a 
highly significant enrichment of functional pathways 
involved in basic tissue development, including loci reg- 
ulating both neurogenesis (P = 9.88 x 10" 14 ) and hema- 
topoiesis (P = 4.59 x 10' 8 ) (Supplementary Table 11 in 
Additional file 1). Bisulfite pyrosequencing was used to 
verify selected CGI DMRs, and examine base pair-speci- 
fic levels of DNA methylation across the nominated 
regions in additional samples. These included the top 
differentially methylated CGI, located within the 
JMJD2B/KDM4B gene, encoding a histone demethylase 
that specifically demethylates histone H3 lysine 9, and 
highly ranked DMRs associated with the neurodevelop- 
mental genes BDNF and EOMES. We observed a highly 
significant correlation between our MeDIP-seq and pyr- 
osequencing data (correlation = 0.58, P = 7.07 x 10' ; 
Supplementary Figure 8 in Additional file 1), with highly 
significant between-tissue DNA methylation differences, 
confirming the MeDIP-seq data, being observed for all 
tested bisulfite-PCR amplicons (Figure 3). 
We observed a highly significant (% P-value = 2 x 10" 
) over-representation (observed/expected (o/e) = 1.73) 
of intragenic CGIs amongst these TS-DMRs, with a 
complementary under-representation (o/e = 0.43) of 
promoter CGIs (Figure 2c) and a highly significant dif- 
ference in average CV between these two classes of CGI 
(promoter CGI mean CV = 0.35, intragenic CGI mean 
CV = 0.60, P < 1 x 10" 10 ). Promoter CGIs are also sig- 
nificantly less-variable across tissues compared to 3' 
CGIs (mean CV = 0.59) and intergenic CGIs (mean CV 
= 0.50). A comparison of the most differentially methy- 
lated CGIs between cerebellum and cortex, which again 
are significantly enriched for pathways related to brain 
development, neurogenesis and functional specialization 
in the brain (Supplementary Table 12 in Additional file 
1), highlights an even more dramatic over-representa- 
tion of intragenic CGIs (o/e = 2.37) and under-represen- 
tation of promoter CGIs (o/e = 0.09) (% 2 P- value = 1 x 
10~ 246 ; Figure 2d). To explore the functional organiza- 
tion of DNA methylation at intragenic CGIs across cor- 
tex, cerebellum and blood, we used weighted gene co- 
methylation network analysis [35] to identify modules of 
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Figure 3 Verification and replication of MeDIP-seq data for three top-ranked CGI DMRs. (a, b) Tissue-specific DNA methylation across an 
intragenic CGI in the JMJD2B/KDM4B gene, (a) MeDIP-seq analysis shows this region is hypermethylated in blood DNA compared to cortex and 
cerebellum (the red bar depicts the region subsequently analyzed by bisulfite pyrosequencing). (b) Pyrosequencing data for this region in an 
extended sample set confirm significant tissue-specific methylation patterns (P = 2 x 10~ 8 ). (c, d) Tissue-specific DNA methylation across a CGI in 
the promoter of the EOMES gene, (c) MeDIP-seq analysis shows this region is hypermethylated in cerebellum DNA compared to cortex and 
blood, (d) Pyrosequencing data for this region in an extended sample set confirm significant tissue-specific methylation patterns (P = 2 x 10~ 5 ). 
(e, f) Tissue-specific DNA methylation across an intragenic CGI in the BDNF gene, (e) MeDIP-seq analysis shows this region is hypermethylated in 
blood DNA compared to cortex and cerebellum from the same individuals, (f) Pyrosequencing data for this region in an extended sample set 
confirm significant tissue-specific methylation patterns (P = 4 x 10~ 9 ). Error bars represent standard error of the mean. 
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co-methylated features via unsupervised hierarchical 
clustering on the basis of high topological overlap (see 
Materials and methods). Six modules were identified 
with clear tissue-specific patterns, demonstrating that 
the methylome is organized into modules of co-methy- 
lated features (Figure 4a). The strongest tissue-specific 
module (blue), composed of approximately 1,000 intra- 
genic CGIs that were hypomethylated in the cortex 
compared to cerebellum and blood (Pearson correlation 
= -0.98, P = 4 x 10' ), representing a network of genes 
involved in nervous system development and function 
(Figure 4b) and significantly enriched for functional 
pathways, including neurogenesis (P = 2.47 x 10' 25 ) and 
the differentiation of neurons (P = 6.18 x 10" 18 ). Analy- 
sis of the top 5% of genes ranked by module member- 
ship using publically available gene expression datasets 
by GeneMania [36] showed that 58.9% are also coex- 
pressed, indicating that the co-methylation networks 
identified here map onto functional gene coexpression 
networks. Furthermore, genes in this module are signifi- 
cantly enriched in relevant gene expression modules 
from published brain gene expression datasets (Supple- 
mentary Table 13 in Additional file 1). A role for differ- 
ential DNA methylation at intragenic CGIs across 
different cell types in the mouse hematopoietic lineage 
has been recently reported [37], and it has been sug- 
gested that these features may regulate transcription 
from alternative promoters across specific cell types 
[19]. 

On average, CGI shores show more tissue-specific DNA 
methylation than CGIs, but do not vary by genie location 

CGI shores have been previously shown to harbor phe- 
notypically relevant tissue-specific patterns of DNA 
methylation [21]. Compared to CGIs, we found CGI 
shores to be significantly more variable across brain 
areas and blood (average CGI CV = 0.46, average CGI 
shore CV = 0.57, P < 1 x 10" 10 ), with DNA methylation 
noticeably less correlated across tissues, especially 
between brain (cortex and cerebellum) and blood (Fig- 
ure 2b). Unlike for CGIs, however, the location of CGI 
shores makes little impact on their variability across tis- 
sues with no over-representation of intragenically 
located features (Supplementary Figure 9 in Additional 
file 1). The top 50 differentially methylated CGI shores 
between cortex, cerebellum and blood are listed in Sup- 
plementary Table 14 in Additional file 1. Strikingly, 
many are associated with genes of known neurobiologi- 
cal function, including PCDH9 (encoding a cadherin- 
related neuronal receptor that localizes to synaptic junc- 
tions, involved in specific neuronal connections and sig- 
nal transduction [38]), NGFR (encoding the receptor for 
nerve growth factor, with widespread effects on neuro- 
development [39]), AUTS2 (encoding the autism 



susceptibility candidate 2, a nuclear protein expressed in 
developing brain regions [40]), SHANK3 (encoding a 
scaffold protein involved in the structural and functional 
organization of the post-synaptic density and also impli- 
cated in autism [41]), and SLIT1 (encoding a protein 
with a key role in cortical development and synaptogen- 
esis [42]). IPA of the most variable CGI shores again 
reveals a highly significant enrichment for functional 
pathways involved in nervous system function and 
development (Supplementary Table 15 in Additional file 
1). Bisulfite pyrosequencing was used to confirm and 
replicate the two top-ranked CGI shore DMRs, which 
are highly significantly hypermethylated specifically in 
the cerebellum, flanking a hypomethylated intergenic 
CGI on chromosome 7 (Supplementary Figure 1 in 
Additional file 10). Again, our pyrosequencing data were 
significantly correlated with the MEDIPS scores across 
each of the three amplicons spanning this region (left 
shore, correlation = 0.76, P = 5.58 x 10~ 5 ; CGI, correla- 
tion = 0.80, P = 1.18 x 10~ 5 ; right shore, correlation = 
0.82, P = 4.36 x 10' 6 ), confirming the validity of the 
methylome data. 

Low CG content promoters are also characterized by 
widespread tissue-specific DNA methylation across brain 
regions and blood 

Our CGI data concur with the notion that CpG-rich 
promoters (that is, HCPs) are predominantly hypo- 
methylated and associated with ubiquitously expressed 
house-keeping genes [20,43]. We next decided to com- 
pare HCP methylation with that seen in LCPs; recent 
methylomic analyses of other tissues indicate that differ- 
ential DNA methylation across LCPs is associated with 
tissue-specific gene expression in somatic cells [20]. Our 
data provide strong evidence to support this notion, and 
like intragenic CGIs, LCPs appear to be a major location 
for tissue-specific DNA methylation signatures across 
brain regions and blood. While hierarchical clustering of 
both HCP and LCP DNA methylation can distinguish 
between tissues, the Euclidian distance between tissues 
is much larger in the case of LCPs (Figure 5a). Principal 
components analysis of our MeDIP-seq data shows a 
much stronger tissue classification based upon LCP 
methylation (Figure 5b), and correlation analyses show 
that while HCP methylation is largely conserved across 
tissues (reflecting the pattern seen for promoter CGIs), 
the correlation of LCP methylation across tissues is 
much lower. 

Between-individual differences in DNA methylation are 
correlated across brain and blood 

Because aberrant DNA methylation is being increasingly 
implicated in the etiology of complex disease phenotypes, 
including several mental health disorders [44], a key 
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Figure 4 Weighted gene co-methylation network analysis of DNA methylation at intragenic CGIs (a) Dendrograms produced by average 
inkage hierarchical clustering of intragenic CGIs on the basis of topological overlap. Modules of co-methylated loci were assigned colors as 
indicated by the horizontal bar beneath each dendrogram. The 'blue' module was strongly negatively co-methylated (r 2 = -0.98, P = 4 x 10~ 5 ) in 
cortex, (b) IPA on the genes associated with the blue module highlighted a network involved in nervous system development and function. 
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Figure 5 DNA methylation across LCPs is strongly associated 
with tissue type, (a) MEDIPS scores across both HCPs and LCPs 
can be used to accurately duster samples by tissue type, but the 
strength of clustering, indicated by Pearson dissimilarity on the y- 
axis, is much higher in LCPs. (b, c) This pattern is reflected in three- 
factor PCA plots (b) and correlation analyses (c), with LCPs 
demonstrating stronger tissue-specific patterns of DNA methylation 
than HCPs. 



question in epigenetic epidemiology concerns the extent 
to which easily accessible peripheral tissues (for example, 
whole blood) can be used to ask questions about inter- 
individual phenotypic variation manifest in inaccessible 
tissues such as the brain [45,46]. Our use of brain tissue 
and blood obtained pre-mortem from the same individuals 
enabled us to investigate the extent to which between- 
individual methylomic variation in the blood is reflected 
across the cortex and cerebellum. Comparing MeDIP-seq 
data from blood, cortex and cerebellum, we observe that 
inter-individual DNA methylation differences are highest 
in blood and lowest in the cortex, with a highly significant 
difference in variability across tissues (P < 0.001; Figure 
6a). The pattern of relative inter-individual variability by 
feature is, however, the same across tissues, with non-pro- 
moter CGIs showing a significantly higher level of 
between-individual DNA methylation differences than 
other features. Strikingly, there is a significant correlation 
between individual DNA methylation differences in the 
blood and those in brain tissue (cortex and cerebellum) 
from the same two individuals (Figure 6b), with the most 
reproducible pattern of blood-detected individual differ- 
ences observed in the cerebellum (correlation = 0.76, P < 
0.001) and slightly less correlation between blood and cor- 
tex (correlation = 0.66, P < 0.001). Although replicate 
DNA samples from each individual were assessed using 
high-resolution SNP arrays to identify potential structural 
variants and minimize the confounding effect of inter-indi- 
vidual copy-number variations (CNVs) that could manifest 
as MeDIP-seq read differences (see Materials and meth- 
ods), the influence of genomic differences between indivi- 
duals cannot be fully excluded. Supplementary Table 16 in 
Additional file 1 lists the top 50 between-individual differ- 
ences in DNA methylation identified in blood between the 
two female MeDIP-seq blood samples and lists the corre- 
sponding methylation scores from cortex and cerebellum 
from the same individuals. Interestingly, while many fea- 
tures show near-identical between-individual patterns 
across all three tissues, some examples of blood-identified 
variation are only detected in either the cortex or cerebel- 
lum. Given recent reports by us and others of widespread 
genotype-associated allele-specific DNA methylation [47], 
it is likely that many of these between-individual DNA 
methylation differences are mediated by DNA sequence 
variation that is common across tissues, although other 
mechanisms such as epigenetic changes occurring early in 
development before complete tissue differentiation could 
also be important. 

Conclusions 

We used MeDIP-seq to undertake the first genomic 
characterization of intra- and inter-individual variation 
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Figure 6 Between-individual variation in DNA methylation is often correlated between blood and brain, (a) Between-individual variation 
in DNA methylation is highest in blood and lowest in the cortex. All tissues show a similarly significant (ANOVA P < 0.001) distribution of 
variability across features, with the greatest between-individual variation occurring in non-promoter CGIs. Scores represent the mean difference 
in normalized MeDIP-seq read density between individual 1 and 2 for each of the feature categories. Error bars represent standard error of the 
mean. Features 1 to 4 = promoter, intragenic, 3', and intergenic CGIs; features 4 to 8 = promoter, intragenic, 3', and intergenic CGI shores; 
feature 9 = CDS; features 10 to 12 = HCPs, ICPs, and LCPs. (b) Between-individual differences in DNA methylation observed in blood are 
significantly (P < 0.001) correlated with differences observed in the cerebellum (correlation = 0.76) and cortex (correlation = 0.66) from the same 
individuals. Scores represent the mean difference in normalized MeDIP-seq read density between individual 1 and 2 for each of the quantified 
features. 



in DNA methylation across multiple regions of the 
human brain and an easily accessible peripheral tissue 
(whole blood). In summary, we show that between-tis- 
sue variation in DNA methylation greatly exceeds 
between-individual differences within any one tissue, 
with clear hierarchical differences in DNA methylation 
across specific brain areas, and between brain and 
blood. Interestingly, we observe that TS-DMRs are 
strikingly under-represented in classic promoter CGIs, 
being located primarily in intragenic CGIs and LCPs. 
These TS-DMRs are dramatically enriched for genes 
known to be involved in brain development and neuro- 
biological function, forming networks of co-methylated 
loci that define the cellular phenotype. Finally, 
although inter-individual differences in DNA methyla- 
tion are dwarfed by tissue-specific variation, and abso- 
lute levels of DNA methylation at specific loci clearly 
differs between cortex, cerebellum and blood, we 
observe that some between-individual variation in 



DNA methylation is correlated between brain regions 
and blood. 

To our knowledge, this study represents the most 
comprehensive cross-tissue inter-individual assessment 
of the methylomic landscape of the brain yet underta- 
ken. The data generated in this project are available as a 
resource to the genomics research community; anno- 
tated UCSC tracks can be downloaded from our labora- 
tory website [15], and the raw data are being integrated 
into the Human Epigenome Atlas [16] as part of the 
regular data release by NIH Epigenomics Roadmap 
Initiative. A primary goal of global research initiatives 
such as the International Human Epigenome Consor- 
tium and the NIH Epigenomics Roadmap is to create 
high-resolution reference epigenome maps across multi- 
ple human tissue and cell types that will expedite the 
application of epigenomic technologies to studies of 
human health and disease [18]. In this regard, and given 
increasing evidence supporting a role for epigenetic 
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disruption in neuropsychiatric disease, a unique aspect 
of this study was our use of blood and multiple brain 
regions from the same individuals, enabling us to 
address questions about the extent to which easily 
accessible peripheral tissues can be used to ask ques- 
tions about inter-individual phenotypic variation mani- 
fest in inaccessible tissues such as the brain. 

In addition to creating reference maps of DNA methy- 
lation across multiple brain regions and blood, we have 
identified key regions of the genome that are character- 
ized by tissue-specific patterns of DNA methylation. Of 
note, TS-DMRs were significantly enriched near genes 
involved in functional pathways related to neurodevelop- 
ment and neuronal differentiation, including BDNF, 
BMP4, CACNA1A, CACA1AF, EOMES, NGFR, NUMBL, 
PCDH9, SLIT1, SLITRK1, and SHANK3. Although DNA 
methylation at promoter CGIs is largely conserved 
across tissues, reflecting data from other studies, we find 
striking evidence that intragenic CGIs are a primary 
location for TS-DMRs. This builds on recent data high- 
lighting cell type-specific DNA methylation at intragenic 
CGIs in the immune system [37], and evidence that 
these features may regulate transcription from alterna- 
tive promoters across specific cell types [19]. Our data 
highlight LCPs as another region characterized by con- 
siderable cross-tissue epigenetic variation; again these 
data support recent methylomic analyses of other tis- 
sues, indicating that differential DNA methylation across 
LCPs is associated with tissue-specific gene expression 
in somatic cells [20]. Given the striking enrichment of 
tissue- relevant pathways and gene sets amongst the loci 
associated with the TS-DMRs we have identified, and 
the observation that many of these genes are differen- 
tially expressed across brain regions (Supplementary 
Table 7 and Supplementary Figure 6 in Additional file 
1), it is likely that they mediate functionally relevant dif- 
ferences in the cellular transcriptome. 

There are several limitations to this study. First, 
although it represents one of the largest cross-tissue 
methylomic analyses performed to date, the number of 
individuals profiled in our initial MeDIP-seq screen was 
relatively small. Our successful verification and replica- 
tion experiments (using bisulfite pyrosequencing and the 
Illumina 450 K array), combined with our gene expres- 
sion data, in larger number of samples, however, high- 
lights the validity of the between-tissue differences we 
observe. Second, the samples used in our initial MeDIP- 
seq screen were all obtained from donors >75 years old 
and may not fully represent patterns of DNA methyla- 
tion earlier in life. However, our observation of highly 
significant gene expression differences corresponding to 
the top 50 cortex-cerebellum DMRs in a younger cohort 
of samples (average age = 62 ± 18 years) suggests that 
many of the tissue-specific differences observed are 



developmentally stable, resulting in functional differ- 
ences at an earlier age. Future work should focus on 
identifying developmental trajectories of epigenetic 
change across multiple tissues. 

In summary, this study reinforces the importance of 
DNA methylation in regulating cellular phenotype 
across tissues, and highlights genomic patterns of epige- 
netic variation across functionally distinct regions of the 
brain. 

Materials and methods 

Sample preparation 

Post-mortem brain samples from nine elderly control 
individuals (free of neuropathological and neuropsychia- 
tric disease) were obtained from the MRC London Neu- 
rodegenerative Diseases Brain Bank. Multiple brain 
regions were dissected from each sample by a trained 
neuropathologist, snap-frozen and stored at -80°C. 
Genomic DNA was isolated from each dissected brain 
region from each sample using a standard phenol- 
chloroform extraction method, and tested for degrada- 
tion and purity prior to analysis. From a subset of indi- 
viduals whole blood samples were also obtained 
longitudinally prior to death, and DNA extracted using 
a standard phenol-chloroform method. A detailed list of 
the samples used for methylomic profiling in this study 
is given in Supplementary Table 1 in Additional file 1. 
An independent set of matched frontal cortex and cere- 
bellum samples (average age = 62 ± 18 years, 33% 
female) for gene expression analysis was obtained from 
42 additional individuals provided by the London Neu- 
rodegenerative Diseases Brain Bank. 

Methylated DNA immunoprecipitation and sequencing 

DNA was fragmented using a Covaris sonication system 
and sequencing libraries were prepared from 5 ug frag- 
mented genomic DNA. End repair, <A> base addition 
and adaptor ligation steps were performed using Illumi- 
na's Paired-End DNA Sample Prep kit. Adaptor-ligated 
DNA was immunoprecipitated by anti-5mC using a 
commercial antibody (Diagenode, Liege, Belgium), and 
MeDIP products were validated by quantitative PCR. 
MeDIP DNA was purified with ZYMO DNA Clean & 
Concentrator-5 columns, and amplified using adaptor- 
mediated PCR. DNA fragments between 220 and 320 bp 
in size were gel-excised, and amplification quality and 
quantity were evaluated by Agilent BioAnalyzer analysis. 
The libraries were subjected to highly parallel 50 bp 
paired-end sequencing on the Illumina Hi-Seq platform. 

Sequencing quality control and alignment 

From the raw fastq files, Illumina quality scores were 
converted into Sanger Phred quality scores using MAQ 
[48]. Quality control was performed on the raw 
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sequence data using FastQC [49]. Supplementary Figure 
1 in Additional file 1 shows FastQC output for one 
representative sample highlighting the high quality 
sequencing data obtained. Alignment to hgl8 was per- 
formed using the Burrows- Wheeler algorithm. 

Correcting for local CpG densities 

The MEDIPS package [14] was used to calculate methy- 
lation scores by incorporating a coupling factor based 
on local CpG density. Bin sizes of 500 bp were defined 
across the genome, with an overlap of 250 bp. The 
number of CpGs within the maximal defined distance 
around the genomic bin was calculated and a calibration 
curve determined relative to the dependency of local 
MeDIP-seq signal intensities and local CpG densities. 

DMR calling across known features using normalized read 
counts 

Mapped reads were also quantified using SeqMonk 
(Babraham Institute, Cambridge, UK). BED files of the 
feature annotations used in this study (CGIs, CGI 
shores, CDS, LCPs, ICPs, and HCPs) are available for 
download from our laboratory webpage [15]. Read- 
depth scores were generated for each feature, normal- 
ized for total read count and feature length. For each 
feature, the CV across tissues/samples was calculated 
(Standard deviation/ (Mean + 1)) and ranked. 

Weighted gene co-methylation network analysis 

We employed weighted gene co-methylation network 
analysis [35,50,51] as described in R [52] to find 
weighted signed co-methylation networks (modules) in 
the cortex, cerebellum and peripheral blood using log 
transformed MEDIPS scores. For each genomic feature 
we performed hierarchical clustering of the samples, 
based on Pearson correlation, and mapped the final 
sample dendrogram to three traits (cortex, cerebellum 
and blood) to describe the sample-trait relationship. 
For each genomic feature type we performed hierarchi- 
cal clustering of the topological overlap matrix. Leaves 
of the tree were grouped into modules, which is a clus- 
ter of highly co-methylated genomic locations (GL). 
After finding the modules, the next step was to 
describe the relationship between modules and each 
tissue (that is, cortex, cerebellum and peripheral 
blood). For this we calculated the Pearson correlation 
coefficient for the module representative also known as 
module eigengene (ME), which is the first principal 
component of each module's methylation profile, and 
each trait. The GL-trait relation (or gene significance 
GS if thinking in terms of gene expression) was 
defined as (the absolute value of) the correlation 
between the GL methylation profile and the trait. For 
each module, we also calculated a quantitative measure 



of module membership as (the absolute value of) the 
correlation of the ME and the GL methylation profile. 
This measure allows us to assess the similarity between 
a module's DNA methylation profile and DNA methy- 
lation at a genomic location. 

Bisulfite pyrosequencing analysis 

Genomic DNA (0.5 ug), extracted from dissected brain 
samples and blood (Supplementary Table 1 in Addi- 
tional file 1), was bisulfite converted using the EZ 96- 
DNA methylation kit (Zymo Research, Irvine, CA, USA) 
following the manufacturer's standard protocol. Fully 
methylated and unmethylated samples were included 
throughout the experimental procedure as assay con- 
trols. Pyrosequencing assays were designed using the 
Pyromark Assay Design Software (Qiagen, Crawley, UK). 
Bisulfite-PCR amplification was performed in duplicate 
using Hot Star Taq DNA polymerase (Qiagen, UK) and 
optimized cycling conditions. Pyrosequencing was per- 
formed using the Pyromark Q24 (Qiagen, UK). A full 
list of bisulfite PCR and sequencing primers is given in 
Supplementary Table 17 in Additional file 1. 

Gene expression analysis 

RNA was extracted from matched cerebellum and 
frontal cortex samples from 42 LBBND donors using 
the Trizol extraction method and purified using an 
RNeasy Mini Kit with DNase I digestion (Qiagen, UK) 
according to the manufacturer's instructions. RNA was 
tested for purity and degradation using an Agilent 
2100 Bioanalyzer and RNA 6000 Nano kit (Agilent 
Technologies, Wokingham, UK). RNA was biotinylated 
and amplified using the Illumina TotalPrep™ RNA 
Amplification kit (Life Technologies, Paisley, UK). 
Gene expression was assessed using Illumina 
HumanHT-12 v4 microarrays (Illumina, San Diego, 
California, USA) according to the standard manufac- 
turer's protocol. Following scanning, signal intensities 
for each probe were extracted using Illumina Geno- 
meStudio and imported into R using the Lumi package 
within Bioconductor. Probes relating to the top 50 cer- 
ebellum-cortex DMRs were tested for expression dif- 
ferences between cerebellum and cortex. 

Genotyping arrays and CNV analysis 

For the individuals assessed by MeDIP-seq, Affymetrix 
SNP 6.0 arrays (Affymetrix, High Wycombe, UK) were 
used to genotype samples from two tissues according to 
the manufacturers' standard protocol. CNVs were iden- 
tified using the PennCNV program [53]. Briefly, this 
implements a hidden Markov model that integrates mul- 
tiple sources of information to infer CNV calls for indi- 
vidual genotyped samples. CNVs were stringently called 
by comparing the duplicate arrays from each individual. 
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Data availability 

The methylomic data generated in this project is avail- 
able for browsing as UCSC Genome Browser tracks 
from our laboratory website [15]. Raw data have also 
been deposited into the Human Epigenome Atlas 
[16,17] and will be integrated into this resource as part 
of the next data release by NIH Epigenomics Roadmap 
Initiative. 

Additional material 



Additional file 1: Supplementary figures and tables This file contains 
Supplementary Figures 1 to 10 and Supplementary Tables 1 to 17. 
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